NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Impact of an adaptive dialog that uses natural language processing to detect students’ ideas and guide knowledge integration.

https://doi.org/10.1037/edu0000902

Gerard, Libby; Holtman, Marlen; Riordan, Brian; Linn, Marcia C (January 2025, Journal of Educational Psychology)

Full Text Available
Adaptive Dialog to Support Student Understanding of Climate Change Mechanism and Who is Most Impacted

https://doi.org/10.22318/icls2023.681776

Bradford, Allison; Li, Weiying; Riordan, Brian; Steimel, Kenneth; Linn, Marcia C (October 2023, International Society of the Learning Sciences)
Blikstein, P; Van_Aalst, J; Kizito, R; Brennan, K (Ed.)
How Does an Adaptive Dialog Based on Natural Language Processing Impact Students From Distinct Language Backgrounds?

https://doi.org/10.22318/icls2023.921177

Holtmann, Marlen; Gerard, Libby; Li, Weiying; Linn, Marcia C; Riordan, Brian; Steimel, Ken (October 2023, International Society of the Learning Sciences)
Blikstein, P; Van_Aalst, J; Kizito, R; Brennan, K (Ed.)
This study takes advantage of advances in Natural Language Processing (NLP) to build an idea detection model that can identify ideas grounded in students’ linguistic experiences. We designed adaptive, interactive dialogs for four explanation items using the NLP idea detection model and investigated whether they similarly support students from distinct language backgrounds. The curriculum, assessments, and scoring rubrics were informed by the Knowledge Integration (KI) pedagogy. We analyzed responses of 1,036 students of different language backgrounds taught by 10 teachers in five schools in the western United States. The adaptive dialog engages students from both monolingual English and multilingual backgrounds in incorporating additional relevant ideas into their explanations, resulting in a significant improvement in student responses from initial to revised explanations. The guidance supports students in both language groups to progress in integrating their scientific ideas.
more » « less
Explaining Thermodynamics: Impact of an Adaptive Dialog Based on a Natural Language Processing Idea Detection Model

https://doi.org/10.22318/icls2023.199424

Li, Weiying; Gerard, Libby; Lim-Breitbart, Jonathan; Bradford, Allison; Linn, Marcia C; Riordan, Brian; Steimel, Kenneth (October 2023, International Society of the Learning Sciences)
Blikstein, P; Van_Aalst, J; Kizito, R; Brennan, K (Ed.)
We explored how Natural Language Processing (NLP) adaptive dialogs that are designed following Knowledge Integration (KI) pedagogy elicit rich student ideas about thermodynamics and contribute to productive revision. We analyzed how 619 6-8th graders interacted with two rounds of adaptive dialog on an end-of-year inventory. The adaptive dialog significantly improved students’ KI levels. Their revised explanations are more integrated across all grades, genders, and prior thermodynamics experiences. The dialog elicited many additional ideas, including normative ideas and vague reasoning. In the first round, students refined their explanation to focus on their normative ideas. In the second round they began to elaborate their reasoning and add new normative ideas. Students added more mechanistic ideas about conductivity, equilibrium, and the distinction between how an object feels and its temperature after the dialog. Thus, adaptive dialogs are a promising tool for scaffolding science sense-making.
more » « less
Analyzing automated content scoring for knowledge integration in science explanations using saliency maps

Riordan, Brian (April 2021, 2021 Annual Meeting of the American Educational Research Association)
null (Ed.)
Models for automated scoring of content in educational applications continue to demonstrate improvements in human-machine agreement, but it remains to be demonstrated that the models achieve gains for the “right” reasons. For providing reliable scoring and feedback, both high accuracy and construct coverage are crucial. In this work, we provide an in-depth quantitative and qualitative analysis of automated scoring models for science explanations of middle school students in an online learning environment that leverages saliency maps to explore the reasons for individual model score predictions. Our analysis reveals that top-performing models can arrive at the same predictions for very different reasons, and that current model architectures have difficulty detecting ideas in student responses beyond keywords.
more » « less
Full Text Available
Analyzing saliency in neural models for scoring content in science explanations

Riordan, Brian (November 2020, BlackboxNLP: Analyzing and Interpreting Neural Networks for NLP)
null (Ed.)
Models for automated scoring of content in educational applications continue to demonstrate improvements in human-machine agreement, but it remains to be demonstrated that the models achieve gains for the “right” reasons. For providing reliable scoring and feedback, both high accuracy and connecting scoring decisions to scoring rubrics are crucial. We provide a quantitative and qualitative analysis of automated scoring models for science explanations of middle school students in an online learning environment that leverages saliency maps to explore the reasons for individual model score predictions. Our analysis reveals that top-performing models can arrive at the same predictions for very different reasons, and that current model architectures have difficulty detecting ideas in student responses beyond keywords.
more » « less
Full Text Available
Probing Saliency in Short Answer Scoring Models for Science Explanations

Riordan, Brian; Bichler, Sarah; Bradford, Allison; Linn, Marcia C. (November 2020, New York Academy of Sciences Natural Language, Dialog and Speech Symposium)
null (Ed.)
Recent work on automated scoring of student responses in educational applications has shown gains in human-machine agreement from neural models, particularly recurrent neural networks (RNNs) and pre-trained transformer (PT) models. However, prior research has neglected investigating the reasons for improvement – in particular, whether models achieve gains for the “right” reasons. Through expert analysis of saliency maps, we analyze the extent to which models attribute importance to words and phrases in student responses that align with question rubrics. We focus on responses to questions that are embedded in science units for middle school students accessed via an online classroom system. RNN and PT models were trained to predict an ordinal score from each response’s text, and experts analyzed generated saliency maps for each response. Our analysis shows that RNN and PT-based models can produce substantially different saliency profiles while often predicting the same scores for the same student responses. While there is some indication that PT models are better able to avoid spurious correlations of high frequency words with scores, results indicate that both models focus on learning statistical correlations between scores and words and do not demonstrate an ability to learn key phrases or longer linguistic units corresponding to ideas, which are targeted by question rubrics. These results point to a need for models to better capture student ideas in educational applications.
more » « less
Full Text Available
An empirical investigation of neural methods for content scoring of science explanations

https://doi.org/10.18653/v1/2020.bea-1.13

Riordan, Brian; Bichler, Sarah; Bradford, Allison; King Chen, Jennifer; Wiley, Korah; Gerard, Libby; C. Linn, Marcia (January 2020, Proceedings of the Fifteenth Workshop on Innovative Use of NLP for Building Educational Applications)

With the widespread adoption of the Next Generation Science Standards (NGSS), science teachers and online learning environments face the challenge of evaluating students' integration of different dimensions of science learning. Recent advances in representation learning in natural language processing have proven effective across many natural language processing tasks, but a rigorous evaluation of the relative merits of these methods for scoring complex constructed response formative assessments has not previously been carried out. We present a detailed empirical investigation of feature-based, recurrent neural network, and pre-trained transformer models on scoring content in real-world formative assessment data. We demonstrate that recent neural methods can rival or exceed the performance of feature-based methods. We also provide evidence that different classes of neural models take advantage of different learning cues, and pre-trained transformer models may be more robust to spurious, dataset-specific learning cues, better reflecting scoring rubrics.
more » « less
Full Text Available

Search for: All records